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Abstract. We consider a model for a one-sided limit order book proposed by 
Lakner et al. [9] and show that it can be coupled with a branching random 
walk. We then use the coupling to answer non-trivial questions about the long- 
term behavior of the price. The coupling relies on a classical idea of enriching 
the state-space by artificially creating a filiation, in this context between orders 
of the book, that we believe has the potential of being useful for a broader class 
of models. 

1. Introduction 

Limit order books. A limit order book is a financial trading mechanism that 
keeps track of orders made by traders, and allows to execute them in the future. 
Typically, a trader places an order to buy a commodity at a certain level x. If the 
current price of the commodity is larger than x when the order is placed, then the 
order is kept in the book and may be fulfilled later in the future, as the price of the 
commodity fluctuates and falls below x. Similarly, traders may place sell orders, 
which gives rise to two-sided order books. Because of the importance of limit order 
books in financial markets, there has been a lot of research on these models, see for 
instance the survey by Gould et al. [5] . 

There are many variants of limit order books, depending for instance on which 
information of the book traders have access to. Typically, traders have only access 
to the current so-called bid and ask prices, that correspond to the lowest sell order 
and the highest buy order, respectively. This gives an incentive for traders to place 
orders in the vicinity of these prices, thus creating an intricate dynamic where 
the state of the book influences its evolution. Stochastic models capturing this 
dynamic have for instance been proposed in Cont et al. [I], Lakner et al. [5] and 
Yudovina |10j . In the present paper we study the one-sided limit order book model 
of Lakner et al. [9] , and our goal is to show how some properties of this model can 
be efficiently studied thanks to a coupling with a branching random walk. 

From a high-level perspective, this coupling adds a new dimension to the initial 
limit order book model by creating a filiation between the orders. Such ideas have 
been extremely successful in queueing theory, see for instance Kendall [7], and we 
believe they can also be useful beyond the context of the model proposed here. For 
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instance, the model proposed by Yudovina [10] is also amenable to a tree represen- 
tation, but the corresponding dynamic on trees is more challenging to analyze than 
the one here. 

Branching random walks. The Galton Watson process is the simplest model 
of branching process. It represents the size of a population that evolves in discrete 
time, where at every time step each individual dies and is replaced by a random 
number of offspring, see for instance Athreya and Ney [T] for more details. A 
branching random walk is an extension of a Galton Watson process that adds a 
spatial component to the model. In addition to the genealogical structure given 
by the Galton Watson process, each individual has some location, say on the real 
line R, that is given by a random displacement of her parent's location. Branching 
random walks can therefore be represented by trees with labels on the edges: the 
structure of the tree represents the genealogy of the underlying Galton Watson 
process, and the labels on the edges represent the displacement of the child with 
respect to her parent's location. In this paper we will consider the simplest model 
of branching random walks, where labels on the edges are i.i.d., and will use results 
by Biggins [2] and Biggins et al. [3] to study the limit order book model. 

Acknowledgement. The author would like Josh Reed for introducing him to 
the limit order book model of the present paper, and Elic Aidckon for interesting 
discussions on branching random walks. 

2. One-sided limit order book model 

2.1. Model. Let us define a book as a finite point measure on R and an order as 
a point of a book. Let B be the set of books. For a book j3 £ B, let \j3\ be its mass 
(i.e., the number of orders it contains) and 7r(/3), which we call the price of the 
book, be the right cndpoint of its support, i.e., the location of the rightmost order: 

tt(/3) = max /3{{x}) > 0} , (3 £ B. 

When the book is empty, i.e., |/3| = 0, the value of the price is inconsequent 
for the purposes of this paper, say for instance tt(/3) = 0. Fix p e [0, 1] and a 
real- valued random variable X. We are interested in the B- valued Markov chain 
(B n ,n > 0) with the following dynamic. 

If the book is empty, the process goes to state Sq in the next time step, where 
here and in the sequel 8 X stands for the Dirac mass at x € R. If the book is not 
empty, a coin with bias p is flipped. If heads (with probability p) , an order is added 
to the current book at a random distance distributed according to X from the 
current price of the book, independently from everything else. If tails (thus, with 
probability 1 — p), an order sitting at the current price is removed. Formally, the 
Markov chain (B n ) has the following dynamic: for any /3 6 B and any measurable 
function / : B — > [0, oo), 

(1) E[f(B n+1 ) | B n =0\ = f(So)l m=Q} +P& [/(/3 + **(/>)+*)] M\P\>o} 

+ (1 <W))1{|£|> }- 

As explained in the introduction, this model (in continuous-time, and with a 
different boundary condition) has been proposed by Lakner et al. [9] to model 
a one-sided limit order book. The interpretation of the model is as follows: B n 
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represents the state of a one-sided limit order book with only buy orders. In each 
time step, either a trader places a new buy order (with probability p), or a trader 
places a market order (to sell the commodity, with probability 1 — p). In the 
latter case, the trader sells the commodity at the highest available buy order, thus 
removing one order sitting at the price. From this perspective, the behavior of the 
price process (7r(_B„), n > 0) is of primary interest. In this paper, we show how one 
can answer questions related to the price process by coupling (B n ) with a branching 
random walk. Our coupling can be used to answer more elaborate questions on this 
particular model, and we believe that it also has the potential to be applied to other 
models. 

Remark 1. To be more specific, in order to recover the model of Lakner et al. [9] 
one needs to apply an exponential transformation to (B n ), i.e., to consider the 
process B n — J2 x eB 7l This transformation makes the orders live on (0, oo), in 
which case the interpretation of ir as a price is reasonable. It also makes the price 
increase in a multiplicative rather than linear fashion, which is a common behavior 
in mathematical finance (e.g., geometric Brownian motion). We prefer to state the 
model on the line with linear displacement because of the analogy with branching 
random walks. 

2.2. Price dynamic. The behavior of the price is asymmetric due to the system's 
dynamic. On the one hand, the price increases when an order is added to the 
right of the current price, and so an increase of the price is distributed according 
to X given that X > 0. On the other hand, the price decreases when an order 
is removed from the book, in which case the decrease of the price depends on the 
distance between the price and the second rightmost order. In particular, orders to 
the left of the price act as a barrier that slow down the price as it wants to drift 
downward. 

Thus, although EX < seems at first a natural condition for the price to drift 
to — oo, it seems plausible that if p is sufficiently close to 1, there will be so many 
orders sitting to the left of the price that they will eventually make the price drift 
to +oo. This intuition turns out to be correct as Theorem [T] below shows. 

This kind of behavior is strongly reminiscent of the behavior of extremal parti- 
cles in branching random walks. There, although a typical particle drifts to — oo 
when EX < 0, one may still observe atypical trajectories due to the exponential 
explosion in the number of particles, see the classical references by Hammersley [6], 
Kingman J8J and Biggins [2]. This analogy has actually been our initial motiva- 
tion to investigate the relation between (B n ) and branching random walks. And 
indeed, we will show in Theorem [2] that B n can be realized as some functional of 
a branching random walk, and this connection will make the proof of Theorem [1] 
quite transparent. 

Theorem 1. Assume that p > 1/2 and that EX exists in (—00,00). 
If EX > 0, then n(B n ) —t +00 almost surely. 

Else, assume in addition to p > 1/2 that EX < and that ¥(X > 0) > 0, and 
let a = infg>o E(e ex ) G (0, 1]. If p > 1/(1 + a), then ir(B n ) +00 almost surely, 
while if p < 1/(1 + a) then 7r(_B„) — > —00 almost surely. 

Remark 2. Let <p(9) = E(e 0X ) for 9 > and assume that EX < 0. We will 
use the following dichotomy: either <p(9) = +00 for every 9 > 0, in which case 
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a = 1; or tp(9) < +00 for some 9 > 0, in which case a < 1 due to the fact that 
ip' (0) = EX < in this case. 

Note that the price process is recurrent if p < 1/2 since (\B n \,n > 0) is a random 
walk reflected at 0, so the previous result gives a full picture of the price behavior 
(except for the boundary cases EX = and p = 1/(1 + a)). It is interesting to 
observe that if p > 1/2, EX < 0, P(X > 0) > and X has a heavy right tail, in 
the sense that <p(9) = +00 for every 9 > or equivalently, the random variable 
max(X, 0) has no finite exponential moment, then the price will always diverge to 
+00, irrespectively of the values of p and EX. Although the fact that exponential 
moments play a key role is clear from a branching process perspective, we find it 
more surprising from the perspective of the limit order book. 

3. Coupling with a branching random walk 

3.1. Intuition. From the book process (B n ), one can construct a genealogical 
structure by making an order x a child of some other order y if x was added 
to the book at a time where y corresponded to the price of the book, i.e., x is 
added at a time n where y = Tr(B n ). Since there is each time a probability p of 
adding an order to the book, it is intuitively clear that this construction will give 
each order (at most) a geometric number of offspring. By labeling the edge between 
x and y with the displacement x — y, which has distribution X, we end up with 
a Galton Watson tree with geometric offspring distribution and i.i.d. real-valued 
labels on the edges, i.e., a branching random walk. The idea of the coupling is to 
reverse this construction and to start from the branching random walk to build the 
book process (B n ). To do so, we will essentially realize the process (£>„) as the 
iteration of a deterministic tree operator $ on a random tree, thus encoding all the 
randomness in the tree. 

Nodes of the tree represent orders of the books, and in order to distinguish 
between orders that are currently in the book, orders that have been in the book 
and removed, and orders that have not been in the book so far (but may be later) 
we consider trees where nodes have one of three colors: green (orders currently in 
the book) , red (orders removed from the book) and white (orders not added in the 
book so far). We also consider trees with real- valued labels on the edges: then, 
each node is also given a label by summing up the labels on the edges from the root 
to the node, the root having label 0. The label of a node represents the position 
of the corresponding order in the book. Then, the green node with largest label, 
say 7, represents the order at the current price, and so we will run the following 
dynamic on trees: 

• if 7 has at least one white child, then its first white child becomes green; 

• if 7 has no white child, then 7 becomes red; 

• if the tree has no green node then we need to draw a new random tree. 

3.2. The coupling: notation and main result. Let T be the set of rooted trees 
where: 

• every edge has a real- valued label; 

• every node has one of three colors, green, red or white; 

• finitely many nodes are green or red, and all green and red nodes are con- 
nected; 

• the root is green or red. 
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We will need to compare the labels and colors of the nodes and edges of various 
trees. In that respect, it is convenient to consider V the set of all possible nodes and 
£ the set of all possible edges, and to denote by V(t) C V the set of nodes, £(t) C £ 
the set of edges and G(t),lZ(t) C V(t) the set of green and red nodes, respectively, 
of a tree t £ T. Nodes inherit labels as explained above, i.e., the label of a node 
is obtained by summing up the labels on the edges from the root to this node, and 
the root has label 0. If v £ V is a node and e £ £ is an edge, we denote by £(v, t) 
and £(e,t) the label of this node and edge in the tree t £ T, provided v £ V(t) 
and e £ £{t). We call genealogical structure of a tree t £ T the tree obtained 
from t when forgetting about labels and colors, and we say that t is a subtree of 
t' and write t C t' if the genealogical structure of t is a subtree of the genealogical 
structure of t' and i(e, t) = £(e, t') for every e £ £{t) C £(*'). For t £ T let T(t) £ B 
be the point measure recording the labels of the green nodes of t: 

ve9(t) 

Let T* = {t £ T : \G{t)\ > 0} be the set of trees with at least one green node. If 
t £ T* we denote by 7(t) the green node with largest label and by ui(t) the number 
of white children of 7(t). If there are several green nodes with maximal labels, 
we choose the first one (where in the sequel, nodes are ordered according to the 
lexicographical order). For t £ T and v £ V(t) we will more generally define to(v,t) 
as the number of white children of v in t, so that w(t) = w(7(£), t) for t £ T* ■ The 
following operator will create the dynamic of {B n ). 

Definition (Operator $). Let $ : T — >• 7" be the following operator: if \G(t)\ = 
then $(t) = t, while if t £ T* , then $ changes the color of one node according to 
the following rule: 

• if Lo(t) > 0, $ transforms the first white child of j(t) into a green node; 

• if u>(t) = 0, then $ transforms ^(t) into a red node. 

We also define as the nth iterate of $, defined by $o being the identity map and 
$„+i = $ o $„ for n > 0. Finally, we define «(t) = inf{n > : |£($„(i))| = 0} for 
t £ T, so that re(f) £ {0, 1, ... , oo}, as the first time where iterating $ on t creates 
a tree with no green node. 

We can now state our main result. 

Theorem 2. Let T be the following random tree: 

• the genealogical structure of T is a Gallon Watson tree with geometric off- 
spring distribution with parameter p; 

• labels on the edges are i.i.d. with distribution X, independently from the 
genealogical structure; 

• all nodes are white, except for the root which is green. 

Then (r($„(T)),0 < n < n(T)) is equal in distribution to (B n ,0 < n < r) started 
at Bq = So, where t = inf{n > : \B n \ = 0}. 

3.3. Proof of Theorem [TJ based on Theorem [2j Thanks to Theorem^ we can 
write B n = r($„(T)) for n < k(T) — r. The idea to prove Theorem[T]is, thanks to 
this representation, to translate the fact that the price drifts to ±oo into conditions 
on the initial tree T, and then to analyze these conditions thanks to classical results 
from branching random walk theory. 
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In the rest of this section we assume that p > 1/2. Since we are interested in 
the long-time behavior of the price which goes back to at time r in the event 
{t < +oo}, we consider everything in the event {t = k(T) = +oo}. It is not hard 
to show that k(T) = +oo if and only if T is infinite, which happens with positive 
probability since we are in the supercritical case p > 1/2. 

The case 7r(B n ) -> -co. Assume that EX < 0, P(X > 0) > and p < 1/(1 + a): 
we want to prove that ir(B„) — > — oo. Since p > 1/2 by assumption, we have 
in particular a < 1 and so there must exist r\ > such that E{e nX ) < +oo (see 
Remark [2] following Theorem Q} . 

Let M n be the rightmost point of the branching random walk T at time n, i.e., 

M n = ma,x{e(v,T) : v e V(T) and \v\ = n} 

where \v\ is the distance from v to the root. Under the assumptions made on X 
and p, Theorem 4 in Biggins [2 shows that M n — > — oo almost surely (in the event 
of non-extinction). Thus to conclude the proof of 7r(_B„) — > — oo, we only have to 
show that M n — > — oo implies that Tr(B n ) — > — oo. 

Let K > and n such that M n < —K for any n > n , i.e., £{v,T) < —K 
for every v G V(T) with \v\ > uq. Since there are only finite many nodes of T at 
depth < no, we must have |7($ n (T))| > no for n large enough and for those n, 
we have ^(7($ n (T)), $„(T)) < —K by choice of no. This proves that ir{B n ) — > — oo. 

The case n(B n ) +oo. Let q be the probability that the price remains non- 
negative at all times: q = P(ir(B n ) > 0,Vn). Observe that when a new order is 
placed in the book to the right of the current price, say at time n and at x > ir(B n ), 
this order defines the new price, and the probability that the price never falls back 
below x after time n is exactly q. Thus, if q > we can define a renewal sequence 
corresponding to the times at which a new order is added to the book, which defines 
a level above which the price stays forever. Then the renewal theorem implies that 
tt(B„) — > +oo. 

Thus, q > is a sufficient condition for ir(B n ) —> +oo. But to compute q, one 
may as well assume that orders placed on (— oo,0) are instantaneously removed 
since these orders only play a role when the price visits (— oo, 0). In terms of 
T, this amounts to removing all nodes with negative labels, as well as all their 
descendants. In more geometrical terms, this amounts to set a barrier at and kill 
every individual (and hence its descendants) of the branching random walk that 
crosses this barrier, which is precisely the model studied in Biggins et al. [3]. In 
this context, q is the probability that this branching random walk with a barrier at 
survives for ever. 

When EX > 0, it is clear that q > 0: indeed, any line of descent of T is 
a random walk with positive drift, which thus has positive probability of never 
visiting (— oo, 0). 

When EX < and P(X > 0) > 0, the situation is more delicate, but Theorem 1 
in Biggins et al. [3] asserts that q > if p > 1/(1 + a) and E(e nX ) < +oo for some 
rj > 0. Thus, to complete the proof of Theorem [TJ it remains only to prove that 
q > if EX < 0, P(X > 0) > and E(e ? ' x ) = +oo for every rj > (note that in 
this case, we are always in the case p > 1/(1 + a) since a = 1, see Remark [2]). We 
prove this by a truncation argument. Let Xk — min(X, K) and — infg E(e 0XK ): 
since a = 1 we have clk — ► 1 as K — >• +oo, and hence p > 1/(1 + ax) for K large 
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enough. For such a K, we have qx > 0, with qn the same probability q as above 
but when the displacements have distribution Xk instead of X. By monotonicity 
(which is obvious on the branching random walk with a barrier), this proves the 
desired result q > and finally concludes the proof of Theorem Q] 



4. Proof of Theorem [2] 

4.1. Study of an auxiliary tree-valued Markov chain. Theorem [2] is very 
intuitive. Unfortunately, the rigorous proof involves quite a lot of formalism, since 
we need to go into the details of the tree dynamic induced by iterations of <1>. In 
order to slightly reduce the notational burden, we will assume that X is a discrete 
random variable; it is just a matter of formalism to extend the proof below to the 
general case. Let us introduce the following tree operators: 

• for t £ T, T(£) is the tree obtained from t by deleting all white nodes; 

• for t £ Tn = {t £ T : \Q{t)\ > and u(t) > 0}, fi(t) is the tree obtained 
from t by turning the first white child of j(t) into a green node; 

• for t £ T* and A £ R, Q'(t, A) is the tree obtained from t by adding a green 
child to 7(i) with label A on the corresponding edge; 

• for t £ T* , ^(t) is the tree obtained from t by turning 7(i) into a red node. 

It will also be convenient to introduce the following subsets of T: 

• Tr is the set of finite rooted trees with only green or red nodes; 

• % = {t £ T : \G(t)\ > and u(t) = 0}; 

• 7o is the set of trees of which every node is white, except for the root which 
is green. 

Note that T = {t : \Q(t)\ = 0} U Tn U 7*. Let Y n = T($„(T)) with T £ % as 
in Theorem[2l We denote by to = T(T) the deterministic tree reduced to the root, 
which is green. The goal of this section is to prove that the process (Y n ,n > 0) 
defines a Markov chain started at to with the following dynamic: for any n > 0, 
any y,y' £ Tr (note that for any t £ % and any k > 0, T($fc(t)) £ Tr) and any 



\Y n+1 = y' | Y n = y) = 




if \Q(y)\ = and y' = y, 
x) if \g(y)\ >0<mdy' =W(y,x), 
if \g(y)\ > and y' = *(y), 
otherwise. 



To this end, we fix until the rest of this section n > 0, x £ R and yk £ Tr for 
k = 0, . . . , n + 1, and we aim to prove that 

(2) P(F fe =y fe ,0<fc<n + l) = 

{1 if \G(y n )\ = and y n+ i = y n , 

p¥{X = x) if \g(y n )\ > and y n+1 = Sl'{y n , x), 
1-p if \g(y n )\ > and y n +i = *(y n ) 

which will prove the Markov property of (Y n ) with the prescribed dynamic. Let 
us study the dynamic of (Y n ). According to the various definitions made, we have 
for any t £ T that $(t) is equal to t if \Q(t)\ = 0, to ft(t) if t £ Tn and to *(t) if 



s 
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t G 7*, SO that 

(T(t) if \G(t)\=0, 

T(*(t)) = ^ T(fi(t)) ifte7h, 

[t(*(*)) ifte7*. 

It is clear that if t G 7fj, then T(0(i)) = fi'(T (i), with £(t) the label on the 
edge between 7(4) and its first white child, while if t G 7*, then T(\I/(t)) = \&(T(t)). 
Thus the previous display can be rewritten as 

fx(t) if = 0, 

T(4(t)) = ^'(T(t)^(t)) if*e7h, 
and since & n +i(t) = we get 

fT(*„(t)) if ia(^»(*))i = 0, 

T($„ +1 (t)) = ^ n'(T(*„(t)), if $„(*) G To, 

[*(T($„(t))) if$ w (t)G7i. 

Since T does not affect the colors of green nodes, we have Q(t) = Q(T(t)) and in 
particular, = \Q(T($ n (t)))\. Plugging in the definitions of 7n and 7i, it 

follows that for any n > and any t G To, we have 

(3) T($ n+1 (t)) = 

fT(*n(*)) if|S(T($ n (t)))| = 0, 

^ fi'(T($„(t)), *($„(*))) if \g(?($ n (t)))\ > and w(* n (t)) > 0, 

[*(T(*„(t))) if |$(T(*„(*)))| > and w(*„(t)) = 0. 

This last equation shows that T($ n+ i(i)) is almost entirely determined by T($„(t)), 
up to the knowledge (hidden by the action of T) of whether 7(<& n (i)) has at least 
one white child in $„(£) or not, and the value £(Q n (t)) of the corresponding edge. 
Further, since Y n = T($„(T)), © leads to 

P (n = Vk, < fc < n + 1) = l{| 6 ( 2/ „)|=o, !/ „ + x= 2 / n }lP > fffc = W, < k < n) 

+ l{|0( 2 /»)l>o, H „ +1 =Q'( J / tl ,*)}P M$»CO) > 0, *(*»(T)) = x, Y k = y k , < k < n) 

+ M\9{Vn) I >0,!/„ +1 =*(y„)}P (w($n(T)) = 0, Ffc = tffc, < fc < fl) 

and so to prove ©, we only have to show that if \Q(y n )\ > 0, then 

(4) P (w($„(T)) > 0, €($ n (T)) = x, F fc = y k , < fc < n) 

= P P(X = x)F(Y k =y k ,0<k<n). 

This property is quite intuitive: given the history Y k for k < n does not give any 
information on the remaining number of white children of 7($„(T)) in $„(T), nor 
on the label on the edge between 7($ n (T)) and its first white child, if any. The fact 
that every node has a geometric number of offspring and that labels on the edges 
are i.i.d. should therefore imply ((4]). To formalize this intuition, we will prove the 
following result, from which one can readily deduce (j4j. For t G T, let in the rest 
of the paper rj(v, t) be the number of children of the node v G V(t). 
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Proposition. If \Q(y n )\ > and P (Yfc = yk,0 < k < n) > 0, then for any t e To 



T = y k ,0 < k < n ^=> y n C £ and r)(v,y n ) = rj{v,t) for every v G Tl(y n ). 

Proof. For £ G T, let in the rest of the proof a(t) = \Q{t)\ + 2\H(t)\ - 1, and recall 
that k(£) = inf{rt > : \Q($ n (t))\ = 0}. It is clear from the definition of $ that 
a($(£)) = o-(*) + l{|a( t )|>o}- Since $„(£) = £ for any n > if |£(£) = and a(t) = 
for £ G 7o, it follows that 



We break the proof of the proposition into two steps. 

First step. Fix some t € To and y G 7r- The first step of the proof consists in 
proving that the following conditions are equivalent: 



(ii) there exists t' G 7o such that T($ (T (j / )(t')) = y, y C t and r](v,y) = rj(v,t) for 
every u G 7Z-(y). 

Proof of (0)=>(ln]). Assume that Q holds, i.e., T($ cr ( !y )(<)) = y: we want to 
prove §u§. Then taking t' — t gives the existence of the desired t'. Moreover, 
since $(a) does not change the genealogical structure of a e T and T(a) only trun- 
cates a, we have T($„(t)) C t for any n > 0, in particular y C t. Then, consider 
any v £ lZ(y). Since all the nodes of £ except for the root are white, the color of v 
results from the successive applications of $ to £ G To- In particular, v being red 
in & a (y) (£) comes from the fact that at some point, none of the children of v were 
white, i.e., uj(v, $&(£)) = for some k < cr{y). Since $ does not create white nodes, 
this implies u)(v, <& a r y -\(t)) = and since T conserves all non- white nodes, v has as 
many children in $ CT ( y )(£) as in T($ ct(3/) (£)) = y, i.e., rj(v, $ . (y) (£)) = i](v,y) which 
gives i](v,t) = T]{v,y). 

Proof of jn])=>([i]). Assume that (JuJ holds: we want to prove So in the rest 
of the proof, consider some £' G To such that T($ CT (j,)(£')) = y, and assume that 
y C £ and r](v,y) = T](v,t) for every v G TZ(y). We prove that y = T($ CT ( y )(£)) by 
induction on o~(y). 

If a{y) = 0, then on the one hand, T($ CT (j,)(£)) = £o while on the other hand, 
\G(y)\ + 2\H(y)\ -1 = implies \Q(y)\ = 1 and \TZ{y)\ = 0, so that y = £ . Thus 
y = T($ CT (j,)(£)) when o~(y) = 0, which initializes the induction. 

Assume now that a(y) > 1. Then a(y) = <r(T($ .( !/ )(£ / )) and so (O implies 
that a(y) < «;(£'). Define y' = T($ (j(2/) _ 1 (£')): since <r(y) - 1 < «;(£'), © implies 
that cr(y') = cr(y) - 1, i.e., a(<^ a ^ y )(t')) = 1 + a(^ a ^_ 1 (£')). This last equality 
means that applying $ on ( I > CT (j,)_ 1 (£') creates a green node or changes a green 
node into a red one, meaning in every case that \G{^ a (y)-i(t / ))\ > and since 
Q^a(y)-i{t')) = a(T($ CT(y) _ 1 (£'))) = Q{y') this finally means that |S(y')l > 0. In 
view of y = T($ (r (j / ) (£')), y' = T((f> cr ( y )_ 1 (£')) and ((3]), we therefore only have two 
possibilities: 



we have 



(5) 



V£ G To, a(T($ fe (£))) = k 



k < k(£). 



(i) T($ CTfe) (£))=y; 
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In either case, we have y' C y and since y C t by assumption, this gives y' C t. 
Moreover, the action of f2' is to add one green node to 7 and the action of VP is to turn 
7 into a red node, so that in either case we have TZ(y') C 7t(y) and r)(v, y') = r](v, y) 
for every v £ lZ(y'). Since r](v,y) = T)(v,t) for every v £ TZ(y) by assumption, this 
implies that r)(v,y') = r](v,t) for every v £ TZ{y')- Since finally y' = T($ .( 3/ /)(t')) 
and a(y') = cr(y) — 1 < u(y), we can therefore invoke the induction hypothesis 
to deduce that T($ CT ( y ')(t)) = y' = T($ (T ( 1/ ^(t')). In particular, and 
^<t(3/')(*') have the same set of green and red nodes and since \G(^ a (y')(t'))\ > 
this shows that \G(® a ( y ')(t))\ > 0. Then, flS) shows that 

Tf$ ,,u = fn'(T(4 ff(l , (i)),i(#^)(t))) if > 0, 

1 CT(y)UJ \*(T($ a(j/0 (t))) if W ($ ff(BO (i)) = 0. 

Since 2/ = T($ cr (j / /)(t)) this can be rewritten as 

TV* ^ ifw(*«r(vo(*))>0. 
H«MwWJ if w ($^ ) (t)) = 0, 

and in view of ([6]), the proof of y = T($ £r ( J/ )(t)) will be complete if we can show 
the two following implications: 

u($ aW) {t')) > > and (t)) = l(*„(y>)V)) 

and 

w($ <r(s /j(t')) = 0=*-w($ <y(w / ) (t))=0. 
To prove these two implications, we will use the identities 

(7) u J (^ {y , ) (t'))=r 1 ( 1 (y'),t')~v(l(y'),y') 
and 

(8) ^ aW) (t)) = v ( 7 (y%t) - v(W),y') 

that come from the fact that T(& a t y n(t')) = T(& a t y n(t)). 

Direct implication: assume that w($ .( 1 ,/)(t')) > 0. In this case, applying $ 
to <&<r( v ')(tf) adds a green child to 7(2/) in ^<j-(j/)(0) an d so viliv'), J/) = 1 + 
V(l(y )>y')- Since 2/ C i by assumption, this gives r)(j(y'),t) > i](j(y'),y') 
which proves that w($ CT ( 9 ')(t)) > in view of ((?]). Moreover, the equal- 
ity Viliy'), T($ ff(l ,/}(f))) = 77(7(y'),T($ CTfe0 (t))) means that 7(2/') has as 
many green and red children in $o-(j/') (^) than in (/). In particular, 

applying $ to these two trees adds the same node, say v, to each tree. 
Then, the equality ^(^WyoW) = ^(^<r(y')(*')) comes from the fact that the 
label on the edge between 7(2/') and v is the same in t and t' due to the 
inclusion y = T($ cr ( y ) (t')) C i; 

Reverse implication: assume that uj($> a ( y i\(t')) = 0. In this case, apply- 
ing $ to $a(y')(t') turns 7(2/') into a red node in $ <J .( J/ j(t / ), i.e., 7(2/') 6 
7?.(2/) and in particular, 17(7(2/')) 2/) = v(l(y')jt) by assumption on j/- On 
the other hand, $ did not change the genealogical structure of j/' and so 

viiiy'), y) = vh(y'),y') wm ch shows that 77(7(2/), ?/') = vdiy')^)- This 

implies o;($ CT ( a /) (/)) = in view of flSJ. 
This concludes the proof of the first step. 
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Second step. We now prove the proposition. Assume in the rest of the proof 
that \Q(y n )\ > and P(Yfe = Vk,0 < k < n) > 0. The direct implication is rather 
straightforward: if T($fc(f)) = yk for every k = 0, . . . ,n, then for k = n the result 
of the first step implies that y n £ t and T](v,y n ) = v{ v >t) f° r every v £ lZ(y n ). 

Let us now prove the converse implication, so assume that y n C t and 7]{y, y n ) = 
r](v,t) for every v £ TZ(y n ). The goal is to prove that T($fc(i)) = yk for every 
k = 0, . . . , n. Since P (Yj, = < k < n) > 0, there exists t' £ To such that 
yk = T($fe(i')) for every k = 0, . . . ,n. Because $ does not erase nodes and never 
changes the color of a red node, we have yk C yk+i and 1Z(yk) C lZ{yk+i) for 
any < fc < n — 1. In particular, for every < k < n it holds that yk £ t and 
rj(v,yk) — T}(v>~t) for every t> 6 lZ{yk). Moreover, 3> cannot create green nodes 
starting from a tree with no green node, and so the condition |(?(j/n)| > implies 
that \G{yk)\ > for every k < n. Thus \G($k(t'))\ > and so k < «(t), so that © 
implies that <r(yk) = k, i.e., yk = T($ CT ( J/J .)(t / )). Thus all the assumptions of the 
first step are satisfied, and we deduce that yk = T($ (7 ( J/) , = T($fe(t)) for every 
k < n. This finally concludes the proof of the proposition. □ 

4.2. Proof of Theorem [2} By regeneration of (B n ) at r + 1 and in view of |T]), 
it is enough to show that (r(<E>„(T)), n > 0) is the Markov chain started at 5o with 
the following transition: for any j3 £ B and any measurable function / : B : [0, oo), 

)] If 1/3 1 >0} 

+ (1 -p)f(P - S^/3))1{\0\ >O }. 

Since T only depends on the green nodes, we have r($„(T)) = T(Y n ) and so we 
only have to show that r(y, i ) is the Markov chain with the prescribed dynamic. 
This is easily verified once one realizes that the a- algebras a(Yk,0 < k < n) and 
a(T(Yk),0 < k < n) are the same. Indeed, the inclusion a(T(Yk),0 < k < n) C 
o"(Yfc,0 < k < n) is trivial. For the reverse inclusion, note that from the sequence 
(r(Yfc),0 < k < n) one can recover the sequence (Yfc,0 < k < n). This can be 
proved by induction on n. For n — this is trivial, since Yq = to and r(Yo) = ^o- 
So assume it holds for n > 0, and let us prove it for n + 1. So assume that 
(r(Yfc),0 < k < n + 1) is known. Then by induction hypothesis, (Yfe,0 < k < n) is 
known. Moreover, ([3]) shows that there are only three possible cases: 

• either Y n+ i = Y n , in which case r(Y n+ i) = T(Y n ); 

• or Y n+1 = ft'(Y n ,X) for some Ael: in this case, T(Y n+1 ) = T(Y n ) + S x ; 

• or Y n+ i — fy(Y n ), in which case V(Y n+ i) — T(Y n ) — b\ for some Ael. 

Thus by comparing T(Y n +i) to T(Y n ) one can recover Y n+ i which shows that 
<j(r(Yfc),0 < k < n) C cr(lfc,0 < k < n). From this, one easily deduces using 
the Markov property of (Y n ) that (r(Y n )) is a Markov chain with the prescribed 
dynamic. The proof of Theorem [2] is therefore complete. 
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